Round 1 - Technical Interview
Work and Conceptual Related:
๐นTell me about yourself, kind of projects and tech stack used.
๐นHow does your day to day work look like.
๐นWhy are you using the tech stack you are using
๐นWhat is the alternative to Medallion Architecture
๐นWhat is the kind and size of data you deal with on daily basis
๐นIf the business is using JSON as the file format and you have to convince the business to use Parquet. How are you going to convince the Business?
PySpark Problem:
๐นGiven the Dataframe, how are you going to split the data into two columns ('Even', 'Odd'), where are even numbers are populated to Even column and Odd numbers are populated into Odd column.
Python Problem:
๐นGiven an array, find the min and max within the array.
SQL Problem:
๐นGiven a table with column 'Country', select the data as below sequence.
Table: Matches:
Col: Country
India
Australia
Pakistan
O/P:
India vs Australia
India vs Pakistan
Australia vs Pakistan
๐นGiven 2 tables as below. Find the count of records for Left Outer Join and Inner Join respectively:
A:
1
1
1
1
B:
1
1
1
๐นNote the values for DenseRank() and Rank() output for below data:
I/P:
85
85
80
75
75
70
Round 2 - Technical Interview
Work and Conceptual Related:
๐นTell me about yourself, kind of projects and tech stack used.
๐นExplain the architecture of Spark
๐นExplain the process how jobs run in Spark.
๐นSome follow up questions like what does Catalyst Optimizer do.
๐นWhat is the difference between Logical Plan and Physical Plan
๐นDifference between ORC and Parquet
PySpark Problem:
๐นRead a CSV file and create a dataframe with properties.
๐นCreate a dataframe with two columns with default String and default Integer respectively.
Python Problem:
๐นGiven a string, output the count of each word into a dictionary:
I/P: string = 'aaabbbccddeeeee'
O/P: Dict = { โaโ : 3 , โbโ : 3, โcโ : 2, โdโ :2 , โeโ :5 }
๐นWrite a python program to count occurrence of an input string in a file . e.g. find the number of
occurrences of the word โTheโ in the sentence โThe lazy fox jumps over the sleeping rabbit. The lazy
rabbit doesnโt wake upโ. has context menu
SQL Problem:
๐นGiven two tables, output the result of Inner, left, right, full Joins respectively.
Table1:
col1
1
1
Table2:
col1
b
a
1
Round 3 - Hiring Manager
Work and Conceptual Related:
๐นTell me the difference between Datalake and Deltalake
๐นWhy did you quit the job?
๐นEven though you have an offer in hand, why did you apply again?
๐นAre you willing to join if we offer same as the offer you have in hand
๐นAre you willing to relocate Bangalore, even though you are settled in Hyderabad. Why?